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ABSTRACT 

We observe that the vocabulary used to construct the "an- 
swer" to problems in computer algebra can have a dramatic 
effect on the computational complexity of solving that prob- 
lem. We recall a formalization of this observation and ex- 
plain the classic example of sparse polynomial arithmetic. 
For this case, we show that it is possible to extend the vo- 
cabulary so as reap the benefits of conciseness whilst avoid- 
ing the obvious pitfall of repeating the problem statement 
as the "solution". 

It is possible to extend the vocabulary either by irre- 
ducible cyclotomics or by x n — 1: we look at the options 
and suggest that the pragmatist might opt for both. 

1. INTRODUCTION 

While sparse polynomials are a natural data structure for 
human beings (who writes :r 10 +0x 9 + 0:r 8 + 0£ 7 + 0:r 6 -|-0:r 5 + 
0a; 4 -|-0a; 3 +0a; 2 -|-0a; 1 — 1?) and computer algebra systems, al- 
gorithms to do more than add and multiply are scarce on the 
ground, and most texts slip silently from considering sparse 
polynomials to considering dense ones [8]. This is partly 
because of the existence of examples showing that the out- 
put can be exponentially larger than the input, and hence 
"nothing can be done". We contend that these examples are 
basically all cases of the cyclotomic polynomials in disguise, 
and that, by admitting these to the output language, as 
Schinzel's if-operator [18] effectively does, these examples 
cease to be absolute barriers to efficient algorithms. Cyclo- 
tomic factors can often be recognised relatively efficiently 
[BJ, though the worst-case is NP-hard from this result. 

Theorem 1 ([TBI Theorem 6.1]). It is NP-hard to solve 
the problem, given a polynomial p(x) £ 1\x\, to determine if 
p has a root r of modulus 1. 
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This is a paradigmatic example of a more general thesis: 
solving problems in computer algebra requires the concur- 
rent design of the most appropriate vocabulary and algo- 
rithms which are polynomial in the size of the output so en- 
coded. Naturally, unconstrained multiplication of new vo- 
cabulary is not a viable solution, and a methodology for 
costing this was proposed in [7]. 

A common problem in computer algebra is "factorize this 
polynomial". The algorithms commonly used first compute 
factorizations p-adically, and then deduce the "true" factor- 
ization over Z. The traditional approaches [23] are theoret- 
ically exponential in the number of p-adic factors, though 
in practice the exponential aspect can be "controlled" pQ. 
Polynomial-time (in the degree, and a fortiori in the number 
of p-adic factors) algorithms are known [12] , but in practice 
tend to be slower. The most recent progress is in 20 , whose 
algorithm is faster in practice, and the deduction phase is 
heuristically polynomial time in the number of p-adic fac- 
tors. 

Of course, for a sparse polynomial such as x n — 1, the size 
of the polynomial is O(logn), and so an algorithm polyno- 
mial in n is still exponential in the size of the input. Are 
there algorithms which are polynomial in the size of the in- 
put? If the output represents the factors as expanded poly- 
nomials, this is impossible. There is however a conjecture 
that the only cases which cause exponential blowups are cy- 
clotomic factors - we will return to this later. 

Notation 1. We define the following for a polynomial 

n rt 

f = a t x' = a„ Y\i x ~ a i) '■ 

i=0 i=l 

#/ the number of non-zero terms in /, \{i : ai 0}|; 

/_! =x"/(i/z) = Er=o a "-^; 
/- =/(-*) = ££=„*(-*)'; 

f e the even part of /, YJl^o a 2i x *; 
fo the odd part of /, Y^"lo a-2i+ix 1 ; 

fi the root-square, or GraeffcQ, of /, ±fflnn™=i( x — a ?) 

.' ' Xfo 5 

H/lloo = max™ =0 [o<|; 

There are various conventions in the literature as to how 
one handles the ± arising from the parity of n. 



\\f\h =x/EL^?; 
Il/lli = EIUM; 

M(f) (the Mahler measure of /) = \a n \ Yl" = i rnax(l, |ai|). 

2. CYCLOTOMIC POLYNOMIALS 



Cyclotomic polynomials are the bugbear of anyone who tries 
to deal with sparse polynomials. 

Example 1. Asking for the factorization, or even the de- 
grees of the factors, of x n — 1 is tantamount to factoring n, 
since for every prime p dividing n, there is an x v ~ x + ■ ■ ■ in 
the factorization of x n — 1. 



Notation 2. Let d(n) denote the number of divisors of 
the number n (including 1 and n itself). 



(1) 



Theorem 2. It is known \22j that 

d(n) < n dog2+<»(l))/loglogr 



However, we should note the caveats about the distribution 
of d(n) given at [111 Theorem 432], in particular that it has 
average order logn but normal order roughly (logn) g2 . 

Definition 1. We will say that a polynomial is cyclo- 
tomic, if all its roots are roots of unity. Many authors re- 
serve this for irreducible polynomials, but we will explicitly 
say "irreducible" when we need to. 

Notation 3. Let $fc be the k-th irreducible cyclotomic 
polynomial: 



$*(*)= n (*-• 



2nij/h\ 



(2) 



gcd(j',fe)=l 



We denote by C„ the cyclotomic polynomial with all n-th 
roots of unity, i.e. C„ — x n — 1. 

We should note that it is not the case that the coefficients 
of $fe are or ±f. The first counterexample is $105, which 
contains the terms — 2x 7 and — 2x 41 . $385 contains the 
terms —3a; 120 and —3a; 121 The growth rate is in fact greater 
than one might expect, and $15015 has terms of 23 a; 2294 and 
23 z 3466 . 150 15 = 3 • 5 • 7 • 11 • 13, and this looks like the 
recipe (confirmed in [4]) to make large valuefl, but in fact 
23 is first attained at 11305 = 5 ■ 7 • 17 ■ 19, as shown in table 
[T] We note the spectacular leap at 40755 = 3 ■ 5 • 11 • 13 • 19, 
which is the largest coefficient up to k — 80,000. 2, Table 
3] shows more such leaps for (much) larger n. 

Theorem 3. JI71 Theorem 1] shows that, for infinitely 
many n, 



log ||$n||cx> > exp 



/ (log 2) (logn) 
V log log n 



(3) 



and indeed this is precisely the right order of (worst-case) 
growth 3 , perhaps better expressed as 



lim sup 



$r, 



log nj log log? 



= log2. 



(4) 



Proposition 4. x" — 1 — Yld\n ®d(x), and these factors 
are irreducible. 

Proposition 5. $„(x) = Yl d ^„{x d - i)^ n / d \ where u is 
the Mobius function. 

Proposition 6. x" — 1 has d(n) irreducible factors. 



Example 2. Similarly, asking for the degree of <&k is, if 
k is p ■ q (p, q distinct primes), tantamount to factoring k, 
since <f>(k) = (p — l)(q — 1) and so 

P, q = - (k + 1 - 4>{k) ± ^/k 2 -2k- 2k(/>(k) + {(j>{k) - l) 2 ) . 

Cyclotomic polynomials are frequently used as examples. 
Example 3. J20\ p. 185] gives this example 

128 112 .80 64 . 48 16 , , 

X —X +X —X +X —X +1, 



2 It does give us (exactly) 500 at 255255 = 3 ■ 5 ■ 7 ■ 11 • 13 ■ 17. 



and states that his algorithm sped up Maple by a factor of 
500 on this example. From a cyclotomic-aware point of view, 
such as £5)', this polynomial is easy. Four applications of 
Graeffe's root-squaring process show (as is obvious to the 
eye) that this is f{x 16 ) where 

cl 1 8 7.5 4.3 ,i 

f[x) = X —x +X —X +X — X + 1. 

Another application takes f to itself, and hence f, and so 
the original polynomial, is cyclotomic. If a is a root of f , 
a 15 = 1, so f — $15, and the original polynomial is 

$15$30$60$120$240- 

Example 4. If p is prime, f = x p — 1, then #/ = 2 but 
f — (x — l)(x p_1 + ■ ■ ■ + 1); two factors with 2 and p terms 
respectively. 

Example 5. Ifp, q are distinct primes, f = (x p — l)(x q — 
1), then #/ = 4 but f = (x - l) 2 ^ 1 + ■ ■ ■ + l)(x 9_1 + 
■ ■ ■ + 1). The square-free decomposition of f is therefore a 
repeated factor with 2 terms and a factor g with pq — p — q + 2 
terms respectively. The largest coefficient in g is min{p, q}, 
and ||ff||i = pq. 

Obviously, a square-free decomposition of / was a bad idea 
in this case: however previously-proposed algorithms, e.g. 
p. 69] tend to do this. 

It could be argued that the problem in this case is the 
'cofactor', but life is not that simple. 

Example 6. Ifp, q are distinct primes, f — (x p —l) 2 (x q — 
1), then #/ = 6 but the square-free factorization is 

(a:-l) 3 (» ,, - 1 + »- + l) a (a: < - 1 + -» + l) 1 

and we are forced to write out the large squared factor. The 
largest coefficient of (s p_1 + ■ ■ ■ + l) is p, so we had also 
better not compute it and then take its square root. 

It is the contention of this paper that all these difficulties 
except the first are caused by an inadequate vocabulary: the 
first seems to be intrinsic, in the fact the factorization of 
numbers can be encoded as a problem of factorization of 
polynomials. All we can do is recognise the fact. 



Table 1: Large coefficients in <3>fc 
[ai| 2 3 4 5 6 7 8=9 14 23 25 27 59 359 

first $ fe 105 385 1365 1785 2805 3135 6545 10465 11305 17225 20615 26565 40755 
4>{k) 48 240 576 768 1280 1440 3840 6336 6912 10752 12960 10560 17280 

[A larger version, independently computed, is in [2].] 



3. REPRESENTATIONAL COMPLEXITY 

Information theory, whether through the guise of Kol- 
mogorov Complexity [13] or Minimum Description Length 
[10] . tell us that good representations of structured objects 
are two-part codes: a model and an encoding of data using 
that model. In other words, the proper "length" of an ob- 
ject consists in counting the length of the representation of 
the model as well as the representation of the data encoded 
using this model. 

In [7] , these results from information theory are rephrased 
so as to apply more directly to Computer Algebra Systems, 
and applications to simplification are outlined. The basic 
result is that for large enough structured expressions, it 
is always worthwhile to first formalize the "structure", and 
then encode the data in such a way as to abstract out that 
structure. Note that, if model extensions are not allowed, 
then simplification reduces simply to length reduction. It 
is exactly the confusion between issues of the (background) 
model-class and its use in model reduction which caused 
Moses [15 to argue that "simplification" was impossible to 
formalize. 

When tackling a particular situation, 7; boils down to 
finding the right vocabulary in which to express ones' result. 
In some cases, the right vocabulary is somewhat counter- 
intuitive. For example, in the case of algebraic numbers, it 
was long ago discovered that using minimal polynomials to 
"encode" an algebraic number was best - although this can 
seem puzzling in the setting of "solving" a polynomial, as 
then the answer to the problem is just an encoding of the 
question. We will return to this issue later. 

For the particular case of factoring of polynomials, what 
does this tell us? All the theoretical results point in the 
same direction: cyclotomic polynomials are somehow "spe- 
cial cases", especially when one is factoring sparse polynomi- 
als. Conventional wisdom already tells us that both dense 
and sparse polynomials are useful model classes, and that we 
should have both at our disposal. The mathematical theory 
of factoring polynomials (as outlined in the rest of this pa- 
per) informs us that cyclotomics are undeniably part of the 
domain of discourse. Combining these together tells us that 
they should also be part of our model classes. The only 
remaining question then is whether adding this particular 
vocabulary actually leads to a simplification. To evaluate 
this, we need to actually display some data structures de- 
signed with this new vocabulary, and then evaluate if we 
have made any real gains. 

4. DATA STRUCTURES 

Since we will be arguing on the size of data structures, 
we will need to define our data representations. The precise 
details might vary, though in practice the conclusions will 
not. For definiteness, we describe our choices according to 
the unaligned packed encoding of ASN.l [19]: note that their 
SEQUENCE is what C programmers would think of as struct. 
Our encodings are intended to be practical, though we ignore 



issues of alignment to word boundaries, and indeed a number 
of [~. . .] operators are also omitted. 

In theory, one needs to have arbitrary sized data fields, 
which means one needs fields for the length of the size fields. 
To avoid this, we will assume that N and K are global pa- 
rameters for the size of the object, bounding the degrees 
and the size of the coefficients. Since a polynomial's factors 
always have smaller degree, we can assume N = log 2 n. It 
is not so easy for the coefficients [14], but we will assume a 
single field of K bits associated with each outermost data 
structure, giving the size k of all the coefficients stored in 
that structure. Since there is one of these, we can ignore its 
cost. 

We next give explicit representations for dense polynomi- 
als, sparse polynomials, factored polynomials, $-aware fac- 
torizations and C-aware factorizations (explained fully be- 
low). 

4.1 A single dense polynomial 

We choose a dense representation with a uniform size 
bound for all the coefficients. A single dense polynomial 
of degree n requires log 2 n bits to represent the degree, and 
then there are n + 1 coefficients. Hence if each of them re- 
quires k bits, we need log 2 k bits for the telling us this, and 
then the coefficients require k + 1 bits (including sign). 

(fc + l)(n + l) + log 2 fc + log 2 n (5) 

In pseudocode^ this might be represented as follows. 

DensePoly ::= SEQUENCE { 

Degree INTEGER ( 



0. 



0. 



k INTEGER 
Coefficients SEQUENCE { 

INTEGER 



} 



} SIZE ( Degree + 1 



4.2 



A single sparse polynomial 

We choose a sparse representation with a uniform size 
bound for all the coefficients. Furthermore, we assume that 
there are t non-zero coefficients, i.e. t terms to be repre- 
sented, and t is bounded by the same bound as the degree. 
A single sparse term from a polynomial of degree n requires 
log 2 n bits to represent the degree. Hence the total space is 
given by 



log 2 n + t(k + 1 + log 2 n). 
In pseudocode, this might be represented as 
SparsePoly ::= SEQUENCE { 

TermCount INTEGER (I . . . 2^ 1 



(6) 



Essentially ASN.l, except that we allow ourselves to write 
mathematics, enclosed in boxes, in the pseudocode. 



Terms SEQUENCE { 

Degree INTEGER ( 



0. 



} 



Coefficient INTEGER 
} SIZE (TermCount) 



Using a Horner scheme might save a few more bits in the 
representation of the exponent, but rarely appreciably so. 

4.3 Representing Factorizations 

We will use the same structure for square- free or complete 
factorizations: the number of (distinct) factors followed by 
pairs (multiplicity, factor). In pseudocode, this might be 
represented as 

Factorization ::= SEQUENCE { 
Factor-Count INTEGER ( 



0. 



Factors SEQUENCE { 

Multiplicity INTEGER ( 
Factor 



Poly 



} 



} SIZE (FactorCount 



where Poly is one of DensePoly or SparsePoly. Hence with 
/ factors, the overhead (i.e. the cost over and above that of 
storing the distinct factors themselves) is 



(/ + l)log 2 n. 



(7) 



4.4 Representing $-aware Factorizations 

We will use the same structure for square- free or complete 
factorizations: the number of (distinct) factors followed by 
pairs (multiplicity, factor) . However, we do this twice: once 
for the factors that are and once for those that are not. 
The factors that are <&k are stored as k followed bjQ <j)(k). 
In pseudocode, this might be represented as 

Factorization ::= SEQUENCE { 

PhiFactorCount INTEGER ( 0. 
PhiFactors SEQUENCE { 

Multiplicity INTEGER ( 

k INTEGER ( 



0. 



0. 



Degree INTEGER ( 
} SIZE (PhiFactorCount; 

FactorCount INTEGER ( . . . 2 A 
Factors SEQUENCE { 

Multiplicity INTEGER ( 
Factor 



0. 



Poly 



} 



} SIZE (FactorCount 



where Poly is one of DensePoly or SparsePoly. Hence with 
I $fc factors, the cost of storing them is (3/ + 1) logn. Any 
<3>/s stored this way is cheaper than in any of the previous 
representation (dense, sparse or factored), so the worst case 
is when there are no $fe in the factorization, when the over- 
head is merely the one field PhiFactorCount. 



4 The inclusion of 4>(k) avoids the problem in example [2] 



4.5 Representing c-aware Factorizations 

Instead of storing the we could store the complete cy- 
clotomic polynomials x k — 1. Again, we will use the same 
structure for square-free or complete factorizations: the num- 
ber of (distinct) factors followed by pairs (multiplicity, fac- 
tor). Also in this case, we do this twice: once for the factors 
that are of the form x k — 1, and once for those that are not. 
One might think to store the factors that are x k — 1 simply 
as k. However, as pointed out in example [T] this will not 
allow us to answer questions such as "how many factors" in 
a reasonable time. Hence we store k followed by its prime 
factorization. 

We should also note that Proposition [S] means that we 
now need negative multiplicities as well. In pseudocode, this 
might be represented as 

Factorization 



SEQUENCE { 

CFactorCount INTEGER ( 0.. 
CFactors SEQUENCE { 

Multiplicity INTEGER 

Degree INTEGER ( 



-2 r 



0. 



NumFactors INTEGER ( 
KFactor SEQUENCE { 

INTEGER ( 



0. 



0. 



} SIZE (NumFactors) 
} SIZE (CFactorCount ) , ' 



0. 



FactorCount INTEGER ( 
Factors SEQUENCE { 

Multiplicity INTEGER ( 
Factor 



Poly 



} 



} SIZE (FactorCount 



where Poly is one of DensePoly or SparsePoly. Hence with 
/ factors of x k — 1 involved, the overhead (i.e. the cost over 
and above that of storing the distinct factors themselves) is 



(/ + 1) log 2 n. 



(8) 



An alternative formulation might store the factors of k 
with multiplicity: there does not seem to be a great deal 
to choose between them. In either case, we should note 
that asking for the number of irreducible factors is no longer 
trivial, since the number of irreducible factors corresponding 
to a single x k — 1 is d(k). Furthermore, since we are allowed 
negative exponents, representing 



"I / 



x K + 1 := 



we have to note that not all the irreducible factors of a; — 1 
are actually factors of the left-hand side. Nevertheless, since 
we have stored the prime factorization of k, the problem is 
efficiently soluble (certainly polynomial time in the size of 
the representation). 

5. REPRESENTING SOME CYCLOTOMIC 
POLYNOMIALS 

We now use each of our representations and compute the 
size of the results. 



Table 2: Representing x n — 1 = Yldin &<i(x) 
Representation Fully expanded Square-free factorization Factored 

l log 2 

Dense 2(n + 1) + log 2 n same as factored n i»giag» l g 2 e 

]_ log 2 

Sparse 31og 2 n same as factored n lo s lo s ™ log 2 e 

"I>fc 31og 2 n same as factored (2d(n) + 1) log 2 n 

x k — 1 3 log 2 n same as factored log 2 n 



Table 3: Representing (x p — l)(x q — 1) = (x — l) 2 <b v (x)<k q (x) with n = p + q 
Representation Fully expanded Square-free factorization Factored 

Dense 2(rt + 1) + log 2 n (1 + log 2 n)(n + 2) + 4 log 2 n 2(n + 3) + 6 log 2 n 
Sparse 4 log 2 n (2n + 2) log 2 n (n + 10) log 2 n 

$ ft 4 log 2 n 6 log 2 n 6 log 2 n 

x k — 1 4 log 2 n 2 log 2 ra 2 log 2 n 



5.1 Factorization of x n - 1 

The sizes of the expanded x n — 1 polynomial in dense, and 
sparse encodings are obvious. The <I>-aware and C-aware 
versions are within an additive constant of DensePoly / 
SparsePoly. For definiteness, we will use SparsePoly based 
counts. 

To understand the size of the factored forms, we need 
to study their sizes a little more closely. In this case, the 
factored form and the square-free form coincide. There are 
d{n) factors, of total degree n, hence n + d(n) terms. By 
Theorem [3] we can bouncQ the size of the coefficients as 
log 2 e times the right-hand side of (J3j , viz. 
the size of a dense polynomial to be 



log log n ' 



(n + d(n)) log 2 eexp 



/ (log 2) (log n) 
V log log n 



+ d(n) log 2 n 



' log log » j g 2 e _ (g) 

In genera^, the factors will be essentially dense, so a 
sparse encoding will save nothing, but have to pay for the 
cost of storing the degrees with each coefficient, adding (n + 
d(n)) log 2 n, to give 



(n+d(n)) ^log 2 n + lo. 



€2 e ex P 



(log 2) (log r 
log log n 



"log log n i g 2 



-d(n) log 2 n 



(10) 



We should note that the asymptotically dominant term is 
the coefficient storage in this model, which is contrary to 
intuition, and even the experimental data in table [T] but 
this merely shows that the asymptotics will take time to be 
visible. 

The results of this section are summarised in Table [2] 

5.2 A square-free factorization 

Let use now consider (x p — l)(x 9 — 1), with p, q distinct 
primes. The "fully expanded" versions are again obvious. 
The square- free factorization of (x p — l)(x q — 1) = (x — 
l) 2 $ p (a;)$ 9 (x) involves multiplying out & p (x)<b q (x). This 
gives us coefficients of size O(n), in fact n/2 assuming that 



5 In theory, not all the factors can have coefficients this large, 
but the gain from exploiting this is relatively small. 

6 x 2 — 1 is an obvious counter-example. 



p, q are balanced, taking (log 2 n) — 1 bits to represent the 
magnitud^D- 

In the factored representation, we have three factors, of 
degrees 1, p — 1 and q — 1, i.e. total degree n — 1. All 
coefficients are bounded by 1. Hence the total is 

(n-l+3)(log 2 n+2) + (3-2+2) log 2 n w (n+10) log 2 n. (11) 

The results of this section are summarised in Table [3] 

6. IMPLEMENTATION NOTES 

6.1 Cyclotomic-free 

It is important to review the encodings of the previous sec- 
tion and notice that for cyclotomic-free cases, these encod- 
ings involve constant overhead, independent of the degree 
and of the number of factors. In fact, by using a bit or two 
in a header word (which modern computer algebra systems 
always seem to use in their internal representations), one 
can choose between these encodings as necessary. In other 
words, cyclotomic-free polynomials do not have to bear any 
extra representation cost for this vocabulary extension. 

We can also construct various mixed cases, in other words 
sparse polynomials which factor into a cyclotomic part and 
a small dense cofactor. The difference in encoding cost is 
correspondingly mixed, although the end result is similar: 
adding cyclotomics asymptotically wins. 

6.2 Which to choose? 

We have posited two encodings for "cyclotomic-aware" 
representations of factorizations: one in terms of the irre- 
ducible cyclotomics (section l4.4|l and one in terms of the 
'complete' cyclotomics Ck = x k — 1 (section I4.5[) . Tables [2] 
and make it clear that adding cyclotomics to ones' vocab- 
ulary is certainly reprensentionally efficient. But which one 
should be used? We first summarize some the advantages 
and disadvantages. 

Pro $fe: clearly polynomial In the ^-representation, the 
factorization of x p+q +x p +x q + l is $!$2p$2 9 , where as 
in the Cfc-representation it is C2 P C~ 1 C2 g C~ 1 . Func- 
tions meant to extract information from products of 
polynomials (degrees, multiplicity, etc) still function 



7 In this case, they are all positive, but we can't count on 
this in general. 



'easily', whereas spotting that the Cfc-representation 
is not square-free is cheap, but not 'obvious'. 

Anti worst-case blowup The last two entries of ta- 
ble [2] show that, for highly composite n, represent- 
ing C n in terms of $£, can require almost n times as 
much, i.e. exponentially more, since the input has size 
O(logn), space, by Theorem [2] 

Notes The pragmatist would probably choose <&. The the- 
oretician would be swayed by the complexity argument 
and want C. Possibly the best answer is to admit <E>, 
and C with an additional extension to the vocabulary 
as in (fT2l . 

To further illustrate one particular difficulty, let us con- 
sider factoring of C105 • It factors into 

$1 (x)$ 3 (x)$ B (x)$ 7 (x)$ 15 (z)$2i (x)$> 35 0)$i05 (x). 

But if all we have at our disposal is C, then the best we can 
do (which is still better than using sparse polynomials) is 

Gsjx) C 5 (x) C 7 (x) C 15 (x)C 1 (x) C 2 i(x)C 1 (x) 
1{X) Cx(x) Gx(x) d(x) C 3 (x)C 5 (x) C 3 (x)C 7 (x) X 

C 35 ( x )C\{x) C W5 {x)C 7 (x)C 5 {x)C 3 (x) 



C 5 {x)C r (x) C 35 (x)C 2 i {x)Cxb (x)Ci (x) 
An even more succinct representation is 

105 

n 

k = 1 

fc|105 



(12) 



at the cost of another vocabulary extension. This is why in 
the representation of C in sect ion |4~5I we list the factors of n, 
which allows us to recover this factorization relatively easily. 
This still means that in the <E>-aware encoding, obtaining 
the number of factors, the degrees of each of the factors, 
or the multiplicity of each of the factors is straightforward, 
while for the C-aware encoding, these simple questions now 
require some (small) amount of computation. , and so forth. 
All of these questions remain just as easy to answer in the 
$-aware encodings as they were before. However, for the C- 
aware encodings, these "simple" questions now require some 
actual computations to resolve. In other words, adding $ to 
our vocabulary is a very minor change with clear efficiency 
gains, while adding C is slightly disruptive but with even 
greater asymptotic efficiency gains. 

7. CONCLUSION 

In section [5] we have shown how some "troublesome" fac- 
torizations, i.e. examples [1] and [5l cease to consume inordi- 
nate space when the cyclotomics are represented explicitly. 
But what of arbitrary polynomials and their factorizations? 

The answer is that we do not know, but there are some 
tantalizing results. [9] state that, provided / is non-reciprocal 
(/ 7^ / nas a factor with at most C2(#/, ]|/|]oo) 

terms, independent of n, and quotes [T7] to say that if the 
polynomial has no reciprocal factors, then all irreducible fac- 
tors have at most c 2 (#f, ||/||oo) terms. This is a significant 
step towards controlling the dependence of the output size 
on n, though it falls short of saying that is polynomial in 
the following ways: 



• there is no guarantee that C2 depends polynomially on 

or log 1 1 /| 1 00, and indeed C2 is still rather mysteri- 
ous to the authors [8] Challenge 4]; 

• nothing is said about the size of the coefficients, and all 
known technology makes them depend exponentially 
on n (i.e. the output size depends linearly on n). 

We note, though, that the known examples of such growth 
|14l §3], depend on cyclotomic polynomials, so one could 
hope that the second problem does not occur in practice. 

We are convinced, and we hope to have convinced the 
reader, that, regardless of whether using cyclotomics ulti- 
mately makes the problem of sparse polynomial factoriza- 
tion more tractable, even if not guaranteed polynomial (see 
Theorem [T]), in the input size, that they are most definitely 
worth having in the basic vocabulary used for the output of 
factoring. We strongly recommend that the specification for 
what it means to factor a polynomial be thus amended. 
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